Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals
نویسندگان
چکیده
This paper presents a novel low-complexity method for extracting periodicity of signals based on their sine-wave representation. In this representation, the signal is modeled as a finite sum of sine-waves, with time-varying amplitudes, phases and frequencies. We describe how one can modify the familiar spectral-comb analysis method to obtain a guaranteed and effective procedure to find the fundamental-frequency which gives the best harmonic approximation of the signal spectrum. The search is efficiently carried out in the frequency domain. The procedure obtains a successive refinement of possible pitch values which are consistent with an increasing number of sine wave components. Other pitch intervals are pruned at an early stage of the search. The advantage of this algorithm is its high accuracy achieved at a relatively low complexity. We also briefly describe one possible application in the area of pitch determination of speech signals.
منابع مشابه
Accurate Short-term Analysis of the Fundamental Frequency and the Harmonics-to-noise Ratio of a Sampled Sound
We present a straightforward and robust algorithm for periodicity detection, working in the lag (autocorrelation) domain. When it is tested for periodic signals and for signals with additive noise or jitter, it proves to be several orders of magnitude more accurate than the methods commonly used for speech analysis. This makes our method capable of measuring harmonics-to-noise ratios in the lag...
متن کاملPitch Marking Based on an Adaptable Filter and a Peak-Valley Estimation Method
In a text-to-speech (TTS) conversion system based on the time-domain pitch-synchronous overlap-add (TD-PSOLA) method, accurate estimation of pitch periods and pitch marks is necessary for pitch modification to assure an optimal quality of the synthetic speech. In general, there are two major issues on pitch marking: pitch detection and location determination. In this paper, an adaptable filter,...
متن کاملSpeech reconstruction from mel frequency cepstral coefficients and pitch frequency
This paper presents a novel low complexity, frequency domain algorithm for reconstruction of speech from the melfrequency cepstral coe cients (MFCC), commonly used by speech recognition systems, and the pitch frequency values. The reconstruction technique is based on the sinusoidal speech representation. A set of sine-wave frequencies is derived using the pitch frequency and voicing decisions, ...
متن کاملWavelet based feature extraction for phoneme recognition
In an effort to provide a more efficient representation of the acoustical speech signal in the pre-classification stage of a speech recognition system, we consider the application of the Best-Basis Algorithm of Coifman and Wickerhauser. This combines the advantages of using a smooth, compactly-supported wavelet basis with an adaptive time-scale analysis dependent on the problem at hand. We star...
متن کاملShape invariant pitch modification of speech using a harmonic model
We present a simple but e ective approach to pitch modi cation of speech based on a harmonic model. Building on our time-scaling algorithm [1], pitch modi cation applies to a harmonically coded glottal wave estimate derived via a simple inverse ltering technique [3]. The modi ed glottal wave subsequently serves as input to an LPC vocal tract lter and the pitch-scaled speech is generated. Shape ...
متن کامل